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Background. Recent data suggest that human coronavirus (HCoV) pneumonia is associated with significant mortality in hema- 
topoietic cell transplant (HCT) recipients. Investigation of risk factors for prolonged shedding and intrahost genome evolution may 
provide critical information for development of novel therapeutics. 

Methods. We retrospectively reviewed HCT recipients with HCoV detected in nasal samples by polymerase chain reaction 
(PCR). HCoV strains were identified using strain-specific PCR. Shedding duration was defined as time between first positive and first 
negative sample. Logistic regression analyses were performed to evaluate factors for prolonged shedding (221 days). Metagenomic 
next-generation sequencing (mNGS) was conducted when 24 samples with cycle threshold values of <28 were available. 

Results. Seventeen of 44 patients had prolonged shedding. Among 31 available samples, 35% were OC43, 32% were NL63, 19% 
were HKU1, and 13% were 229E; median shedding duration was similar between strains (P = .79). Bivariable logistic regression 
analyses suggested that high viral load, receipt of high-dose steroids, and myeloablative conditioning were associated with prolonged 
shedding. mNGS among 5 subjects showed single-nucleotide polymorphisms from OC43 and NL63 starting 1 month following 


onset of shedding. 
Conclusions. 


High viral load, high-dose steroids, and myeloablative conditioning were associated with prolonged shedding of 


HCoV in HCT recipients. Genome changes were consistent with the expected molecular clock of HCoV. 
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Respiratory viruses are associated with prolonged shedding, 
higher rates of lower respiratory tract disease, and mortality in 
hematopoietic cell transplant (HCT) recipients. Development of 
novel therapeutics and effective infection prevention has been 
critically important, especially for well-established respiratory 
viruses such as respiratory syncytial virus, influenza virus, and 
parainfluenza viruses [1-4]. With new molecular diagnostics 
widely available, similar concerns have been raised for other 
respiratory viruses including human coronavirus (HCoV) [5]. 
In addition to the demonstration of frequent prolonged shed- 
ding of HCoV after HCT [6], recent data suggest that com- 
mon HCoVs (229E, OC43, NL63, and HKU1) are important 
respiratory pathogens related to significant mortality in HCT 
recipients [7]. Data on host and virologic factors associated with 
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prolonged shedding, including genome evolution within a host, 
may provide a rationale for the development of antiviral therapy 
at various stages, but are currently lacking. Therefore, we exam- 
ined HCT recipients to define viral and host factors associated 
with prolonged HCoV shedding in the upper respiratory tract 
and examine evolution of viral genomic sequences over time by 
metagenomic next-generation sequencing (mNGS). 


METHODS 


Study Design 

We reviewed HCT recipients with HCoV detected in nasal 
samples by multiplex respiratory viral polymerase chain reac- 
tion (PCR) at the Fred Hutchinson Cancer Research Center. 
Subjects were required to have a negative viral PCR test within 
2 weeks of the last positive virology testing performed. If the 
interval between consecutive positive tests was beyond 2 weeks, 
strain identification was performed using both samples to 
confirm the strains were the same. The subjects were identi- 
fied from 2 cohorts (Supplementary Figure). The first cohort 
included patients whose nasal samples were collected and tested 
for clinical purposes when respiratory symptoms were pres- 
ent from March 2009 through June 2016. The second cohort 
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came from a prospective surveillance study of HCT recipients 
undergoing transplant from December 2005 and February 2010 
[8]. Standardized respiratory symptom surveys and multiplex 
respiratory PCR tests were performed weekly during the first 
100 days posttransplant, then every 3 months through year 1 
posttransplant and whenever respiratory symptoms occurred 
between days 100 and 365 posttransplant. Only subjects with 
respiratory symptoms were selected for the current study, and 
no duplicated subjects were analyzed. Separately, mNGS was 
conducted when 24 positive samples with cycle threshold (Ct) 
values of <28 were available irrespective of presence of respira- 
tory symptoms from the above-mentioned prospective surveil- 
lance study of HCT recipients. Demographic and clinical data 
were collected from the database and medical chart review. The 
study was approved by the Institutional Board Review at Fred 
Hutchinson Cancer Research Center. 


Laboratory Testing and Definitions 

HCoV detection and viral load were determined from nasal 
specimens by quantitative reverse-transcription PCR as part 
of a multiplex PCR used to detect 12 respiratory viruses. 
Strain-specific PCR was performed using saved nasal samples 
according to a previously published protocol [9]. mNGS was 
performed on DNAse I-treated RNA extracts from 0.45-uM 
filtered nasal specimens using “tagmented” (transposon-me- 
diated fragmentation) cDNA libraries with 15-20 cycles of 
PCR amplification when 24 samples with Ct values of <28 
were available [10]. Sequence reads were trimmed using cut- 
adapt and aligned to a concatenation of the 4 HCoV refer- 
ence genomes (NC_002645, NC_005831, NC_006577, and 
KF530069) using Geneious version 9.1 [11]. The duration of 
shedding was defined as time between the first positive and 
first negative sample. Prolonged shedding was defined as the 
duration of shedding 221 days, which was described to be a 
median shedding duration of HCoV during the first 100 days 
after HCT [6]. Highest daily steroid dose and lowest cell count 
in the 2 weeks prior to first HCoV detection were recorded. 
Conditioning regimen was categorized into myeloablative and 
nonmyeloablative/reduced intensity based on the definition 
previously described [12]. 


Statistical Analysis 

Univariable and bivariable logistic regression analyses were 
performed to evaluate associations between virologic and 
host factors and prolonged shedding. Only the first episode of 
HCoV infection per subject was used for the outcome analyses. 
Variables with P < .2 in the univariable models were candidates 
for bivariable models. Kruskal-Wallis test was performed to 
compare continuous values among more than 2 groups. Two- 
sided P values <.05 were considered statistically significant. All 
statistical analyses were performed using SAS 9.4 for Windows 
(SAS Institute, Cary, North Carolina). 


RESULTS 


Host and Virological Characteristics 

We identified 20 and 24 HCT recipients with respiratory HCoV 
infection from cohort 1 and cohort 2, respectively (42 adult and 
2 pediatric patients) (Table 1 and Supplementary Figure). The 
median duration of shedding was 14 days (4-60 days), and 17 
patients had prolonged shedding (221 days). Among 31 avail- 
able nasal samples, 35% were OC43, 32% were NL63, 19% were 
HKU1, and 13% were 229E. The median shedding duration of 
HCoV in nasal samples did not differ between strains (Figure 1; 
P=.79). 


Outcome Analyses 

Initial high viral load (Ct value below the median) was associ- 
ated with prolonged shedding with the lowest P value (<.01) by 
univariable analysis. Univariable and bivariable logistic regres- 
sion analyses indicated that initial high viral load was associated 
with prolonged shedding consistently in all models (‘Table 2). 
High-dose steroid use (21 mg/kg/day) prior to HCoV diagnosis 
and myeloablative conditioning regimen were associated with 
prolonged shedding in the bivariable analyses. Four patients 
started viral shedding prior to transplant; therefore, we sepa- 
rately analyzed 40 patients who started shedding after trans- 
plant, and the results remained similar (data not shown.) 


Whole-Genome Sequencing 

Whole genomes of OC43, NL63, and HKU1 were consecutively 
sequenced in samples from 4 HCT adult subjects and 1 pediat- 
ric subject where samples were available for 19 to 132 days fol- 
lowing the first positive sample (Table 3). Engraftment occurred 
in 4 patients prior to the start of shedding. No majority consen- 
sus variants were recovered for any patient <30 days after onset 
of shedding. Single-nucleotide variants accumulated at a rate of 
approximately 1 variant per 3-4 weeks, consistent with previous 
estimates of the HCoV molecular clock (Figure 2) [13, 14]. No 
single-nucleotide polymorphisms (SNPs) of OC43 and HKU1 
were recovered in patients 4 and 5, respectively. One adult 
patient (patient 3) developed lower respiratory tract disease in 
the setting of high-dose steroid use for acute graft-vs-host dis- 
ease during prolonged shedding. Bronchoalveolar lavage was 
performed at day 73 after starting the shedding, from which 
Aspergillus fumigatus was detected in addition to HCoV OC43. 
One 18-year-old pediatric patient (patient 4) had 3 different 
HCoV strains detected in succession over a period of 5 months. 


DISCUSSION 


In this study, we demonstrated a significant association between 
prolonged shedding of HCoV and initial high viral load in 
transplant recipients. In addition, prior high-dose steroid use 
and myeloablative conditioning regimen appear to be asso- 
ciated with prolonged shedding. The duration of shedding 
appeared to be similar across all 4 HCoV strains. No drastic 
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Table 1. Clinical Features of Patients With Human Coronavirus Upper Respiratory Tract Disease 


Characteristic Total (N = 44) 
Female sex 18 (41) 
Age, y, median (range) 54 (7-73) 
Transplant number >2 12 (27) 
Cell source 

Cord 3 (7) 

Bone marrow 4 (9) 

PBSC 37 (84) 
Donor type 

Autologous 3 (7) 

Related 21 (48) 

Unrelated 20 (45) 
Conditioning regimen 

Myeloablative 18 (41) 

MA or RIC 26 (59) 

Onset of shedding relative to transplant 

Pretransplant 4 (9) 

0-100 days posttransplant 22 (50) 

>100 days posttransplant 18 (41) 
Human coronavirus strains 

OC43 11 (25) 

NL63 10 (23) 

229E 4 (9) 

HKU1 6 (14) 

Unknown 13 (30) 
Ct value, median (range) 28.3 (19.2-39.4) 
Lowest WBC count <1000 x 10° cells/L? 14/36 (39) 
Lowest lymphocyte count <300 x 10® cells/L# 18/36 (50) 
Lowest neutrophil count <500 x 10° cells/L# 13/36 (36) 
Lowest monocyte count <100 x 10° cells/L? 16/36 (44) 
Highest daily steroid dose# 

None 26 (59) 

<1 mg/kg 12 (27) 

>1 mg/kg 6 (14) 


Patients With Prolonged Shedding (n = 17) 


Patients With Short-term Shedding (n = 27) 


7 (41) 11 (41) 
54 (7-67) 55 (14-73) 
6 (35) 6 (22) 
2 (12) 1 (4) 
2 (12) 2 (7) 
13 (76) 24 (89) 
1 (6) 2 (7) 
6 (35) 15 (56) 
10 (59) 10 (37) 
10 (59) 8 (30) 
7 (41) 19 (70) 
4 (23) 0 
6 (35) 16 (59) 
7 (41) 11 (41) 
3 (18) 8 (30) 
5 (29) 5 (19) 
2 (12) 2 (7) 
2 (12) 4 (15) 
5 (29) 8 (30) 
26.1 (19.2-39.4) 28.8 (19.6-39.4) 
3/15 (20) 11 /21 (52) 
6 /15 (40) 12/21 (57) 
3/15 (20) 10 /21 (48) 
5/15 (33) 11/21 (52) 
8 (47) 18 (67) 
5 (29) 7 (26) 
4 (24) 2 (7) 


Data are presented as No. (%) unless otherwise indicated. 


Abbreviations: Ct, cycle threshold; NMA, nonmyeloablative; PBSC, peripheral blood stem cell; RIC, reduced intensity; WBC, white blood cell. 


4ln the 2 weeks prior to first human coronavirus detection. 


80 
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Figure 1. Duration of shedding according to human coronavirus strain. The bars 
indicate median values and first and third quartiles (P= .79 by Kruskal-Wallis test). 


intrahost evolution of viral genomes occurred in this immuno- 
compromised population with prolonged shedding. 

Severe acute respiratory syndrome and Middle East respira- 
tory syndrome (MERS) coronaviruses are recognized as highly 
human-pathogenic coronaviruses causing fatal lower respira- 
tory tract disease [15-17]; however, there are no established 
antiviral therapies [18, 19]. Recent data suggest that lower 
respiratory tract disease caused by 4 other HCoV strains (229E, 
OC43, NL63, and HKU1) was also associated with significant 
respiratory support and mortality in immunocompromised 
hosts [7]. The unmet need for the development of antiviral ther- 
apy against HCoV is expected to expand as immunocompro- 
mised populations grow. We found that initial high viral load, 
prior high-dose steroid use, and myeloablative conditioning 
were important factors associated with prolonged HCoV shed- 
ding in HCT recipients. Duration of viral shedding is often used 
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Table 2. Univariable and Bivariable Logistic Regression Analyses for Prolonged Shedding (n = 44)? 


Univariable Model 


Bivariable Model 1 


Bivariable Model 2 Bivariable Model 3 Bivariable Model 4 


OR Adjusted OR Adjusted OR Adjusted OR Adjusted OR 
Covariates Categories (95% Cl) PValue (95% Cl) PValue (95% Cl) PValue (95% Cl) PValue (95% Cl) PValue 
Ct value <28.3 vs 228.3 6.5 (1.6—-26) <.01 11.6 (2.1-64.7) <.01 11.0 (2.1-58.8) <.01 5.1 (1.0-25.2) .05 5.5 (1.1-26.8) .03 
Conditioning MA vs NMA/RIC _ 3.4 (.95-12) .06 6.9 (1.3-38) .03 
regimen 
Highest steroid 21 vs <1 3.85 (.6-24) alls) 10.1 (1.1-96) 05 
dose® (mg/kg/ 
day) 
Lowest WBC <1.0 vs >1.0 0.23 (.05-1.05) .06 0.33 (.06-1.7) .18 
count? (x10 
cells/L) 
Lowest neutrophil <0.5 vs 20.5 0.28 (.06-1.3) 10 0.37 (.07-1.9)  .23 
count? (x 10° 
cells/L) 
Lowest lympho- <0.3 vs 20.3 0.5 (.13-1.9) 31 
cyte count® 
(x108 cells/L) 
Lowest monocyte <0.1 vs 20.1 0.45 (.12-1.8) .26 
count? (x10® 
cells/L) 
Transplant number 22 vs <1 1.91 (.5-73) 35 
Age at diagnosis Ascontinuous 0.98 (.94—-1.03) A8 


Abbreviations: Cl, confidence interval; Ct, cycle threshold; MA, myeloablative; NMA, nonmyeloablative; OR, odds ratio; RIC, reduced intensity; WBC, white blood cell. 


4Variables with P< .2 in the univariable models were candidates for bivariable models where data only support inclusion of 2 factors per model. 


®In the 2 weeks prior to first human coronavirus detection. 


as an endpoint at early stages of clinical trials for new antiviral 
drugs [20, 21]. Stratification based on risk factors is critical to 
avoid imbalances due to host and viral factors in randomized 
trials, which might otherwise mask true differences of experi- 
mental agents. 


Table 3. Specimens Sequenced in this Study 


Genetic variability of HCoV OC43 at the community level 
and intrahost heterogeneity of MERS coronavirus have been 
reported [22-25]. Such variability might have important impli- 
cations in viral disease pathogenesis, and the study of viral 
genome evolution within a host can provide vital information 


Patient Strain Species Day® Ct Values HCoV Reads Total Reads Accession 
1 NO7-196B NL63 -10 26.1 291 765 465084 KY554969 
07-262B NL63 4 25.9 72640 249121 KY829118 

NO7-468B NL63 42 26.2 59658 191697 KY554971 

2 NO6-1144B NL63 48 25.7 341 740 428487 KY554967 
NO7-6B NL63 64 277 67223 342554 KY674915 

07-64B NL63 79 24.5 136330 239 866 KY674916 

NO7-185B NL63 107 24.1 30368 79584 KY554968 

07-324B NL63 135 25 56685 542 950 KY554970 

8 NO9-33B OC43 g) VAR) 128622 213882 KY554974 
09-382B OC43 77 271 8063 137 198 KY554975 

09-595B OC43 118 276 6297 1044785 KY674920 

4 07-1541B OC43 116 23.8 160033 580450 KY554972 
07-1609B OC43 130 24.7 43692 221874 KY674917 

07-1647B OC43 137 22.7 107905 147 986 KY674918 

N08-87B HKU1 174 27 2ST 737630 KY674921 

08-434B 229E 248 272 12019 2981 384 KY674919 

5 NO9-1605B HKU1 29 20.8 1636757 2936518 KY674943 
09-1627B HKU1 36 23.5 491 949 877 155 KY674942 

NO9-1663B HKU1 47 278 9164 266311 KY674941 


The number of HCoV reads, total reads, Ct values, and accession number are depicted for each of the specimens for which whole genomes were recovered. 


Abbreviations: Ct, cycle threshold; HCoV, human coronavirus. 
@Day is relative to engraftment. 


206 + JID 2017:216 (15 July) + Ogimiet al 


Day 0 
Day 14 
S$4421A U714A 
Day 52 | | 
91%U 98%U 
9%G 2%C 
NL63 | MN } 
) 
Day 0 
Day 16 
GH51D 
Day 31 l 
85%A 
15%G 
L1246F_ F107I 
AT1LV 
Day 59 — | IE i 
94%G 53%U 75%U 78%A 
6%A 47%C 25%C 22%U 
GH51D F1071 
A38968 —C.14484U V3681 ATLV 
Day 87 a 
80%U —-53%U 63%A  60%A 87% A 
20%G 47% C 37%G  40%G 8%C/5%U 
NL63 ® 
) 
Day 0 
L265R 
Day 68 ae | 
78%U 82%G STAG 0G 
22%C 18%A 19%U 25%U 
T2836A 
P2841 U8464C L265R C26326U 
Day 109 | Il | | 
78%U 53%C 67%A 75%U 
22%C 47%U 33%G 25%C 
0C43 replicase replicase PYHEY ike YN) 


’® 


Figure 2. Genome evolution of human coronavirus over time. Single-nucleotide variants in the consensus genome for patient 1 (A), 2 (B), and 3 (C) are depicted. Patients 
4 and 5 showed no variants over time for the same coronavirus species (Table 3). Both nonsynonymous and synonymous variants are depicted relative to the day 0 genome 
recovered. Majority consensus nonsynonymous changes at allele frequency >50% are indicted in red, while majority consensus synonymous changes at allele frequency 
>50% are shown in green. Allele frequencies for variant sites are depicted when the majority allele had <95% frequency. Nonsynonymous changes are shown for the amino 
acid of a given gene, while synonymous changes are shown for the nucleotide of that gene. Allele frequencies alone are shown when the majority consensus base is not 
a variant relative to the day 0 genome but the majority allele frequency at that base is <95% (in flux). Abbreviations: HE, hemagglutinin esterase; M, membrane protein; N, 


nucleocapsid protein. 


important in developing and assessing antiviral agents [26]. For 
example, development of antiviral drug resistance during indi- 
vidualized therapy that is associated with poor outcome has been 
described extensively with influenza virus [27-29]. Grad et al 
reported intrahost genome evolution of respiratory syncytial 
virus over time in an infant with severe combined immune defi- 
ciency who underwent a bone marrow transplant [30]. The viral 
population diversity dramatically increased after engraftment, 
which appeared to reflect dynamic response to immune pres- 
sure from host immunity. To our knowledge, no previous data 
exist describing how the HCoV genome evolves within a host 
over time. In the current study, engraftment occurred in 4 of 5 
patients sequenced prior to the onset of shedding. Interestingly, 


no SNPs were recovered <30 days after the onset of shedding 
even after immune reconstitution. Variants accumulated start- 
ing at 1 month after the onset of shedding (1-6 changes over 
time), consistent with the previously estimated evolution rates 
of HCoV [13, 14]. Given the relatively slow evolution rate of 
coronaviruses, these observations could be promising from the 
standpoint of antiviral resistance and therapeutic development 
[13, 14]. Due to their exceptionally large RNA genomes, corona- 
viruses are known to encode highly processive polymerases as 
well as proofreading exoribonucleases that temper viral genome 
evolution relative to other RNA viruses [31-33]. Further epide- 
miological and biochemical work is required to characterize the 
functional impact of the variants recovered here. 
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The main limitation of this study was the relatively small 
sample size; thus, bivariable logistic regression analyses were 
performed instead of multivariable analyses to evaluate risk 
factors for prolonged shedding. Similarly, although no partic- 
ular strain appeared to be associated with prolonged shedding, 
strain identification using saved samples was successful in only 
70% of the patients, which limited our ability to detect small 
difference of shedding duration among each HCoV strain. 
Further studies with larger sample sizes will help to clarify 
the distinct association between particular HCoV strains and 
prolonged shedding. Finally, our cohort included 4 patients 
who had documented HCoV shedding prior to transplant. 
Considering the unmeasured influence of their different back- 
grounds on our analyses, we separately analyzed 40 patients 
who started shedding after transplant. Only univariable logistic 
regression analysis could be performed due to the small sample 
size, with similar results. 

This is the first study to evaluate risk factors associated with 
prolonged shedding of HCoV by quantitative and strain-specific 
reverse transcription PCR as well as intrahost genomic evolu- 
tions by metagenomic RNA sequencing in transplant recipients. 
Our study provides critical information to develop antiviral 
therapies and design randomized trials with viral load end- 
points. In addition, as the duration of shedding is an important 
determinant of viral infectivity and transmissibility, predictive 
factors for prolonged shedding may provide useful information 
for effective infection control, such as the expected duration of 
isolation. Further studies are needed to validate the risk factors 
including particular HCoV strain for prolonged shedding. 


Supplementary Data 


Supplementary materials are available at The Journal of Infectious Diseases 
online. Consisting of data provided by the authors to benefit the reader, the 
posted materials are not copyedited and are the sole responsibility of the 
authors, so questions or comments should be addressed to the correspond- 
ing author. 
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